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Cellular automata are investigated towards their ability to compute transductions, that is, to trans- 
form inputs into outputs. The families of transductions computed are classified with regard to the 
time allowed to process the input and to compute the output. Since there is a particular interest 
in fast transductions, we mainly focus on the time complexities real time and linear time. We first 
investigate the computational capabilities of cellular automaton transducers by comparing them to 
iterative array transducers, that is, we compare parallel input/output mode to sequential input/output 
mode of massively parallel machines. By direct simulations, it turns out that the parallel mode is 
not weaker than the sequential one. Moreover, with regard to certain time complexities cellular au- 
tomaton transducers are even more powerful than iterative arrays. In the second part of the paper, 
the model in question is compared with the sequential devices single-valued finite state transducers 
and deterministic pushdown transducers. It turns out that both models can be simulated by cellular 
automaton transducers faster than by iterative array transducers. 

1 Introduction 

Cellular automata have widely been investigated as a massively parallel computation model. In con- 
nection with the problems to recognize syntactical patterns, or in our terms formal languages, cellular 
automata have been considered in HI for the first time. Over the years substantial progress has been 
achieved in this field but there are still some basic open problems with deep relations to other fields (see, 
for example, 0). The results comprise the relations between parallel and sequential input mode, the im- 
pact of two-way and one-way inter-cell communication, the relation between different time complexities 
such as real time, linear time, or arbitrary time, the capabilities with respect to accept linguistic language 
families such as regular and context-free languages, closure properties, and decidability questions. 

Computational models are not only interesting from the viewpoint of recognizing some input, but 
also from the viewpoint of transforming an input into an output. For example, a parser for a formal 
language should not only return the information whether or not the input word can be parsed, but also 
the parse tree in the positive case. Here, two things are important: first the information whether the 
input is accepted, i.e., whether the input can be parsed, and second the production of the output, i.e., the 
construction of a parse tree if the word can be parsed. If the word cannot be parsed, the output is no 
interest. This motivation led to the investigation of models such as finite state transducers or pushdown 
transducers which are classical finite automata or pushdown automata where each transition is associated 
with some output. If an input is accepted, the result of the transduction is the output of the transitions in 
the order they have been applied. Both models have been studied in detail (see, for example, IUI21), and 
many applications such as in the context of parsing are known. 

Parallel transducers have been investigated in |3, 6]. In the first paper, one-way linear iterative arrays 
are introduced. In this model, the input is supplied to the leftmost cell, and the output is emitted at the 
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rightmost cell. In between there are as many cells as the input is long, the information flow is one-way, 
from left to right, and the output does not depend on the fact whether or not the input is accepted. In 
the latter paper, iterative array transducers are introduced, where the leftmost cell receives the input and 
emits the output. For the computational capacity of such devices, the time complexities for processing 
the input as well as for computing the output are crucial. It turned out that iterative array transducers 
with time complexity (real-time, real-time) are less powerful than those working in (real-time, linear- 
time). In turn, the latter are less powerful than iterative array transducers with time complexity (linear- 
time, linear-time). Moreover, it has been shown that deterministic finite state transducers as well as 
certain deterministic pushdown transducers can be simulated by iterative arrays with time complexity 
(real-time, real-time), whereas nondeterministic single-valued finite state transducers are simulated by 
iterative arrays in (real-time, linear-time). 

Here, we complement these results with investigations of the computational capacity of cellular au- 
tomaton transducers compared with iterative arrays and sequential devices. The detailed definitions and 
two examples are given in Section [2] The results on the computational capacities compared with itera- 
tive array transducers are obtained in Section [3] First, we present a construction which shows that any 
iterative array transducer can be simulated by some cellular automaton transducer preserving the time 
complexity as long as the time complexity is 'fair'. Together with the example of a transduction not 
computed by any iterative array in real time, we obtain as a consequence that cellular automaton trans- 
ducers are more powerful than iterative array transducer with respect to the time complexities (real-time, 
real-time) and (real-time, linear-time). For the combination (linear-time, linear-time) both devices are 
shown to be equally powerful. In Section|4] we compare the model in question with sequential machines. 
In particular, it is shown they can simulate single-valued finite state transducers and deterministic push- 
down transducers faster than iterative arrays. The former are simulated in (real-time, real-time) whereas 
the latter are simulated in (real-time, linear-time). 

2 Preliminaries and Definitions 

We denote the rational numbers by Q, and the non-negative integers by N. For the empty word we 
write A, the reversal of a word w is denoted by w R , and for the length of w we write \w\. For the number 
of occurrences of a symbol a in w we use the notation \w\ a . The set of all words over the alphabet A 
whose lengths are at most j > is denoted by A- J . We write C for set inclusion, and C for strict set 
inclusion. In order to avoid technical overloading in writing, two languages L and U are considered to 
be equal, if they differ at most by the empty word, that is, L \ {A} =L'\{X}. 

A (one-dimensional) two-way cellular automaton transducer is a linear array of identical determin- 
istic finite state machines, sometimes called cells, where each cell except the two outermost ones is 
connected to its both nearest neighbors. We identify the cells by positive integers. The transition of a 
cell depends on its current state and the current states of its neighbors, where the outermost cells receive 
information associated with a boundary symbol on their free input lines. The cells work synchronously 
at discrete time steps. 

The input/output mode for cellular automaton transducers is called parallel. One can suppose that 
all cells fetch their input symbol during a pre-initial step. Here we assume that each cell is additionally 
equipped with an output register that is initially empty and can be filled once by the cell. When all output 
registers have been filled, the transduction is completed. 

Definition 1 A cellular automaton transducer (CAT) is a system (S,F,A,B, #, 8), where 
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1. S is the finite, nonempty set of cell states, 

2. F CS is the set of accepting states, 

3. ACS is the nonempty set of input symbols, 

4. B is the Unite set of output symbols not including the special symbol _L, 

5. is the permanent boundary symbol, and 

6. 8 : (SU {#}) x S x (SU {#}) ->• S x (B* U {_L}) is the local transition function. 

A configuration of a cellular automaton transducer M = (S,F,A,B,#,8) at time t > is a descrip- 
tion of its global state, which can formally be described by two mappings c t : {l,2,...,n} — > S and 
o t : {1,2, . . . ,n} — > B* U {-L}, for n > 1, which map the single cells to their current states and to their 
output emitted, where _L means no output so far. The operation starts at time in a so-called initial con- 
figuration, which is defined by the given input w = a^2 • • • a n G A + , and no outputs. We set c^f 1 (?) = a, 
and £>o (0 = -L for 1 < i < n. Successor configurations are computed according to the global transition 
function A. For convenience, we write 8 S for the projection on the first component of 8, that is, the suc- 
cessor state, and 8„ for the projection on the second component, that is, the output emitted. Let (c t ,o t ), 
t > 0, be a configuration with n > 2, then its successor (c f+ i,o (+ i) is as follows. 

'8 s (c t (i-\),C t {i),C t (i+\)) if /G {2,3,. ..,72-1} 

Q+i(0 = \ 8 s (#,c t (\),c t {2)) if i=\ 

^8 s (c t (n-\),c t (n),#) if i = n 

For n = \, the next state of the sole cell is 8 s (#,c t (1),#). For o, we obtain 

o t (i) ifo ? (/)/_L 

8 (c t (i-l),c t (i),c t (i+l)) if o t (i) = _L and i G {2,3, 1} 
S (#, c, ( 1 ) , c t (2)) if o t ( i) = ± and i = 1 

8 a (c t (n — 1 ) , c t (n) , #) if Of (i) = _L and / = n 



o t +\{i) = < 



and, thus, A is induced by 8. As usual we extend A to sequences of configurations and denote it by A*. 
That is, A is the identity, A' = A(A _1 ), 1 < t, and A* = Uo<t Thus, (c t ,o t ) € A*(c,o) indicates that 
it is possible for M to go from the configuration (c,o) to the configuration (c t ,o t ) in a sequence of zero 
or more steps. 

An input w is accepted by a CAT if at some time step during the course of its computation the 
leftmost cell enters an accepting state. Transducer M transforms input words w G A + into output words 
v eB*. For a successful transformation M has to accept the input, otherwise the output is not recorded. 
Moreover, each cell has to emit an output: 

M(w) = v, 

if w is accepted by M, (c t ,o t ) G A*(cq W \oo), o t (i) ^ _L, 1 < i < \w\, and v = o t {\)o t (2) • • -o t {\w\). The 
transduction realized by M, denoted by T(M), is the set of pairs (w,v) G A + x B* such that M(w) = v. 

Letti,t : N — > N be two mappings. If for all (w,v) G T(M), the input w is accepted after at most f/(|w|) 
time steps, and o t (i) ^ ±, 1 < i < \w\, after at most ? (|w|) time steps, then M is said to be of time 
complexity (ti,t ) and we write CAT Wo . The family of transductions realized by CAT ?i;fo is denoted by 
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j7(CAT tfjto ). If tj or t Q is the identity function n, we call it real time and write rt. If f,-(n) or t {n) is of the 
form k ■ n, for some k G Q, > 1, we call it linear time and write /?. 

If we build the projection on the first components of T(M), then the cellular automaton transducer 
degenerates to a cellular automaton acceptor (CA). The projection on the first components is denoted 
by L(M). 

In order to clarify the notation we give two examples. The first example shows that CAT can copy 
their input in real time. The second example shows that CAT can sort a binary input in real time. 

Example 2 The transduction { (w,ww) \ w 6 {a,b} + } belongs to the family ^(CAT,-,,,t)- The basic 
idea is to use one register to shift the input from left to right and another register to shift the input from 
right to left. Additionally, two signals are started at both ends which cause the cells to emit the correct 
output (see Figure [T]). 

More detailed, let w = a\a2 ■ ■ -a n be the input and n be even. In the first time step, cell 1 emits 
a\a2 and cell n emits a n -\a n . In the next time step, cell 2 emits a^a^ and cell n — 1 emits a„-3a„-2- 
This is possible since the necessary information has been shifted to the corresponding cells and their 
neighborhood. Generalizing this observation to 1 < i < \, we obtain that cell i emits a2i-\a2i and cell 
n — i+l emits a n -2i+iQn-2i+2 at time i. If n is odd, the construction is similar. Now cells i and n — i+l 
emit a2i-\a2i and a n -2i+\an-2i+2 at time i, for 1 < i < [j\ . Finally, cell [|] emits a n a\ at time step [5] . 

□ 
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Figure 1: Schematic computation of a CAT rt rt copying input a\a2 •••a%. The step at which a cell has 
emitted its output is highlighted. 



Example 3 Here, we consider the transduction { (w,a^"b^ b ) \ w G {a,b} + } and show that it is com- 
puted by a CAT rt rt . The basic idea is that any two neighboring cells where the left cell carries a b and 
the right cell carries an a switch their contents. By this local transpositions, all a's will eventually be in 
the left part of the array whereas all b's will be in the right part (see Figure[2]). 

Clearly, on any input of length n the correct sorting has been achieved after at most n time steps. 
There is one problem to be solved: since the transpositions are only local, a cell cannot know whether its 
current content is final and has to be emitted. We can cope with the problem by synchronizing all cells 
at time n. The synchronization is realized by the well-known Firing Squad Synchronization Problem 
(FSSP), that is implemented on an additional track. In the first time step, two instances of the FSSP 
are started, one in the leftmost cell and the other one in the rightmost cell. First, let n be even. When 
the leading signals of both instances meet in the center at cells | and | + 1 , they are reflected and cells 
1 , 2, . . . , j and cells | + 1, | +2, . . . , n are treated as two separate instances of the FSSP. Since each FSSP 
can be set up to synchronize m cells in time 2m, we obtain that | cells are synchronized in time n. Since 
both FSSP work in parallel, we obtain that all cells are synchronized at time step n. The remaining case 
where n is odd is handled similarly. □ 
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Figure 2: Schematic computation of a CAT rt)rt sorting an input babaabba by local transpositions. The 
final synchronization is not depicted. 



3 Computational Capacity of Cellular Automaton Transducers 

This section is devoted to the computational capacity of cellular automaton transducers compared with 
the power of iterative array transducers as studied in 0. While CAT are arrays of finite state machines 
working in parallel input/output mode, iterative array transducers receive the input and emit the output 
sequentially by the leftmost cell, the so-called communication cell. 

Basically, an iterative array transducer (IAT) is a linear array of deterministic finite state machines, 
where each cell except the leftmost one is connected to its both nearest neighbors. The distinguished 
leftmost cell is the communication cell that is connected to its neighbor to the right and to the input/output 
supply (see Figure [3]). Initially, all cells are in the so-called quiescent state. At each time step the 
communication cell reads an input symbol and writes a possibly empty string of output symbols. To 
this end, we have different local transition functions. All cells but the communication cell change their 
state depending on their current state and the current states of their neighbors. The state transition and 
output of the communication cell depends on its current state, the current state of its neighbor, and on the 
current input symbol (or if the whole input has been consumed on a special end-of-input symbol). As for 
cellular automaton transducers, the cells work synchronously at discrete time steps. 

M203 • • • b m « — I 
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s . 



So ■ 



so 



0102^3 • • • O n <l 



Figure 3: An iterative array transducer. 



For details of later constructions, we provide the definition more formally. An iterative array trans- 
ducer (IAT) is a system (S,F,A,B, <,so, 8, 80), where S is the finite, nonempty set of cell states, FCS 
is the set of accepting states, A is the finite set of input symbols, B is the finite set of output symbols, 
<d ^ A is the end-of-input symbol, so € S is the quiescent state, 8 : S 3 — > S is the total local transition 
function for non-communication cells satisfying 5(jo,*o,*o) = *0> an d 80 : (AU {<}) X S 2 — > B* x 5 is 
the partial local transition function for the communication cell. The IAT halts when the transition func- 
tion 80 is not defined for the current configuration. Similar as for CAT, an input w is accepted when 
the communication cell enters an accepting state at some time t during the computation. The pair (w, v) 
belongs to the transduction computed by the IAT if w is accepted, the communication cell halts, and v 
is the total output emitted during the computation. The mappings ?, and t for the time complexities are 
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defined analogously to CAT,, fo . If t{ or t is the function n + 1, we call it real time and write rt. If ?,(n) or 
f (n) is of the form k ■ n, for some keQ,k>l,we call it linear time and write /?. 

Any transduction computed by a cellular automaton can be divided into two tasks. One is the ac- 
ceptance of the input, the other one the transformation of the input into the output. Both tasks have to 
end successfully in order to obtain a valid computation. On the one hand, this allows to modularize con- 
structions of cellular automaton transducers as both parts can be implemented independently on different 
tracks. On the other hand, this implies that a language, which is not accepted by any cellular automaton 
in time cannot be the projection on the first components of any transduction belonging to any class 
^(CAT f/ . t J. Unfortunately, it is a long-standing open problem whether there are languages accepted by 
two-way cellular automata in arbitrary, that is, exponential time but cannot be accepted in real time (see, 
for example, 0). However, the same observation applies to transductions computed by iterative arrays. 
So, we obtain the following theorem. 

Theorem 4 Let t a : N — > N, n < t (n), be a time complexity. Then there exists a language belonging to 
Zr(CAT rt st), but not to ST{lAT rtSo ). 

Proof The language 

L = {kxkk- • -toci$ji&- • -hyk& \ k > l,xf =)%• and yi,Zi G {a,b}*} 

is not accepted by any real-time IA [5]. However, it is linear context free. Since all linear context-free 
languages are accepted by one-way cellular automata in real time [7 ], the transduction { (w, a' vv ' ) | w G L } 
is a witness for the assertion. □ 

The previous theorem shows that there are transductions which cannot be computed by any iterative 
array that has to accept the input in real time. In fact, the limitation arises from the limitation to accept 
languages. This raises the question whether there are witness transductions whose projections on the first 
components are accepted by real-time iterative arrays, that is, the limitation is a limitation to transform 
the input in time. The next example answers the question for real time in the affirmative. 

Example 5 In [6] it has been shown that the transduction { (w,w R ) | w 6 {a,b}* } does not belong to the 
family ^(IAT rt .,-,). However, it can be computed by a CAT,, rt M as follows. Transducer M performs 
two tasks on different tracks in parallel. The first one is to synchronize the cells in real time as has been 
shown in Example [3] The second task is to reverse the input in real time. So, when the cells fire they 
emit their current input symbol in order to complete the transduction. 

The second task is computed by a cellular automaton M' = (S,F,A,B, #, 8} that itself uses two tracks 
which are implemented by the state set5= (AU{A}) 2 (see Figure Let (p\,qi), {p2,qi), and (773,^3) 
be arbitrary states from S. Then 

5 21 ),#) = (qi,pi), 

5(#,(p h qi),(p2,q2)) = (P2,pi), 
8({Puqi),(.P2,q2),#) = (92,91), and 

<5((/?l,9l),(/>2,92),(/?3,93)) = (P3,9l) 

shift the contents of the upper track to the left and the contents of the lower track to the right. Symbols 
arriving at the left end are copied to the lower track, and symbols arriving at the right end are copied to 
the upper track. In this way the input circulates. If M' is started with w G A + on its upper track and empty 
lower track, then the reversal of w is written on the lower track after in \w\ time steps, that is, when the 
FSSP of the first task fires. □ 
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Figure 4: Space-time diagram of a two-way cellular automaton reversing its input. 



Theorem[4]left open whether there is a proper inclusion between the transduction families or whether 
they are incomparable. Though real-time two-way cellular automata accept a strictly larger family of lan- 
guages than real-time iterative arrays, we cannot conclude that there is an inclusion between the trans- 
duction families. The reason is that iterative arrays receive their input sequentially to the communication 
cell and emit the output sequentially by the communication cell as well. So, when the last input symbol 
is read by the leftmost IAT cell, the last output is emitted also by the leftmost cell. However, for CAT 
this last output has to be at the right of the remaining output. Therefore, the usual simulation of an IA by 
a CA where the leftmost CA cell simulates the communication cell and the input is successively shifted 
to the left does not work. Nevertheless, the next result shows that the simulation is possible as long as 
'fair' time complexities are considered. Clearly, an iterative array transducer can emit an output up to 
the time complexity t , while in a cellular automaton transducer each cell can emit only one output. So, 
any time complexity t a larger than linear time yields a trivial transduction computed by an IAT but not 
by any CAT and, from this point of view, is 'unfair'. 

Theorem 6 Let tj,t : N — > N be two mappings so that f,-(n) <k\-n, t (n) < ki ■ n, for two constants 
1 < &i,&2- Then any transduction belonging to £?(IAT thto ) is computed by some CAT tuto . 

Proof Let M = (S,F,A,B,<l,so,8,8o) be an IAT, fo . By standard techniques M can be modified such 
that a cell never reenters the quiescent state after having left it, and that never more than n cells are non- 
quiescent on inputs of length n until the transduction is completed. The former property can be achieved 
by introducing a new state to which non-quiescent cells change instead of the quiescent state. The latter 
property is obtained by grouping max{&i,&2} cells into one. 

As mentioned above, the transduction computed by M can be divided into two tasks running on 
different tracks. Since any language accepted by a ? ; -time iterative array is known to be accepted by a 
?,-time two-way cellular automaton as well, it remains to be shown how to simulate the transformation 
of the input into the output by a CAT M' = (S' ,F' ,A,B,#,8'). 

Assume for a moment that &2 = 1, that is, t D is real time. Basically, the idea of the simulation (of the 
second task) is as follows (see Figures[5]and[6]>. Every cell of M' has five registers. In the first and second 
register, cells of M are simulated. So, they initially cany the quiescent state of M. At the beginning, the 
leftmost cell of M' simulates the communication cell of M for two time steps. Then the second cell of 
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M' simulates the communication cell for another two time steps, and so on. When cell i simulates the 
communication cell, then the concatenation of the first two registers of cells i, i — 1, . . . , 1 represent the 
states of the cells 1 , 2, . . . , 2i — 1 or 1, 2, . . . , 2i of M. In order to provide the necessary input symbols for 
the simulation of the communication cell, on the third track M' shifts its input to the left at every other 
time step. To this end, a modulo two counter is maintained in the fourth registers. When the end-of-input 
symbol meets the simulation of the communication cell in cell \n/2] , the simulation of M is completed. 
To conclude the idea of the construction the output has to be described. In the right half of the automaton, 
each cell passed through by the end-of-input symbol emits A . In the left half, each cell emits its output 
when it has simulated two steps of the communication cell. The output is the concatenation of the two 
outputs generated by the communication cell. To this end, the first output has to be remembered for one 
time step in the fifth register. Dependent on the parity of the length of the input, the last output possibly 
has to be emitted by a cell having simulated only one step of the communication cell. So, the output of 
the IAT is simulated in the left half of the CAT while the right half actually emits the empty word. 
Formally, the construction is as follows (see Figures[5]and[6]). 

5' = 5x5x (AU{<}) x {0,1} xB^ j , 

where j is the length of the longest output emitted in one step by the communication cell of M, and < is 
the end-of-input symbol of the IAT. On input w = a\ai---a n G A n , cell i is initially in state (so,so,aj,0, X). 
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Figure 5: Structure of CAT registers and denotation of their contents. Here all fifth registers are empty. 

A cell that detects that its left neighbor has just filled the first two registers, starts to simulate the 
communication cell for two time steps. Similarly, so does the leftmost cell at initial time. As before, 
8o, s (§s) denotes the first component of the value of 8q (8), while 8o. a (8 a ) denotes the second component. 
So, for pi,p2,qi / so, we define 

8'(#,(s ,so,i 2 ,0,X),(so,so,h,0,X)) = ({8o, s (i 2 ,so, s ), s , 12,1,80,0(12^0, so)), k), 
5'(#,(P2 5 J0,12,1,J3), (50^0,1*3,1, A)) = 

({8 s (p2,so,so),8oAh,P2,so),h,0,k),p8o,o{i3,P2,so)), 

8'((pi,qi,ii,0,X),(so,so,i 2 ,0,X),(so,so,i3,0,X)) = ((8o tS (i 2 ,q\,Pi),so,i2, 1, §o,o(h, qi,Pi)), k), 

8'{(p\,qi,h,l,X),{p 2 ,so,i2,\,fi),{so,so,hA,k)) = 

((8 s {p2,qi,Pi),8oAh,P2,qi),h,0,X),p8o, (i3,P2,qi))- 

A cell that already has simulated two steps of the communication cell continues to simulate cells of the 
IAT by applying the following transitions, where pi,qi,P2,q2 7^ so' 

8'(#,(p 2 ,q2,h,d2,k),(P3,q3,h,d 3 ,X)) = 

({8 s (p 2 ,so,so),8 s (q 2 ,P2,so),i2+d 2 , 1 -d 2 ,X),X), 
8'((pi,qi,h,di,X),(p2,q2,h,d2,A>),(p3,q3,h,d3,A>)) = 

{{8 s {p2,q\,P\),8 s {q2,P2,q\),i2+d 1 A-d 2 ,X),X). 
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Figure 6: Principle of a CAT simulating an IAT. The input is ci\a2 • • -a n and m is even. Depicted are the 
two consecutive configurations at time steps m and m + 1 for the IAT and for the CAT. 

Finally, the cells that did not simulate a step of the communication cell behave according to the following 
transitions, where p\ ^sq: 

8\{pi,s Q ,ii,\,X),(so,so,i2,\,X),(sQ,SQ,h,\,X)) = (Oo^o, «3,0, A), A), 
8'((s Q ,SQ,ii,0,X),(so,so,i2,0,X),(so,s ,h,0,X)) = ((s ,s , h, l,X),X), 

S'((pi,so,h,l,X),(so,so,i2,l,h),#) = ((sq,s q ,<\,0,X),X), 
8'((so,so,ii,0,X),(so,so,i2,0,X),#) = ((so, so, ?2, 1, X), X). 

This concludes the construction for the assumption = 1> that is, for t (n) = n. Now let constant &2 be at 
least two. In this case, the simulation is slightly modified as follows. Each cell simulates successively &2 
steps of the communication cell. In order to provide the correct input symbols the input is shifted to 
the left k2 — 1 times within &2 steps. To this end, a modulo &2 counter is maintained. The simulation is 
completed when the rightmost cell (cell n) of the CAT has finished to simulate the communication cell. 
Clearly, at that time ■ n steps have been simulated. Similar to the construction above, a cell emits its 
output after having finished to simulate the communication cell, and the output is the concatenation of 
the outputs computed during these &2 steps. □ 
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So, for the time complexities real time and linear time, the parallel input/output mode is not weaker 
than the sequential one. In fact, Theorems [4] and [6] imply that the former is strictly stronger for (rt,rt) 
and (rt,lt): 

Corollary 7 The family 37~(IAT rt , rt ) is strictly included in 3~(CAT rt ^), and ^(lAT rtM ) is strictly in- 
cluded in y(CATrt,it). 

Since the families of languages accepted by two-way cellular automata and iterative arrays in lin- 
ear time are known to be identical, the questions for the precise relations between ^(IAT; f r/ ) and 
^(CAT/ f rt ) or between ^(IAT/,^) and ^(CAT/,,/,) raise immediately. 

Proposition 8 The family 3?{lATi t rt ) is strictly included in 2F{CAT\ t n ). 

Proof The inclusion ^(IAT/ ? rt ) C ^(CAT/ ? rt ) follows again by Theorem [6] Moreover, if the trans- 
duction { (w,w R ) | w 6 {a,b}* } would be computable by some IAT/^, t , then it would be computed by an 
IAT rt rf , since the trivial input to be accepted is {a,b}*. However, by Example [5] this language separates 
the families ^(IAT rt>rt ) and ^(CAT rt , rt ) and, thus, it separates ^(IAT/ ?)rt ) and ^(CAT/ ?)rt ). □ 

For the last time complexity in question (It, It) we obtain a different situation. The parallel and 
sequential input/output modes are equally powerful. 

Theorem 9 The families ^{lAT h j t ) and ^(CAT^ ft ) are identical. 

Proof The inclusion j7(IAT/ f ,/ f ) C ^(CAT/^/,) follows once more by Theorem|6] 

Conversely, an IAT/, /, can simulate a CAT;, / f as follows. In a first phase, it reads the input and 
stores it successively in its cells. In a second phase, the iterative array transducer starts a FSSP in the 
communication cell, that synchronizes the n cells within 2n — 2 time steps. Finally, all cells start the 
simulation of the CAT at the same time. Clearly, the iterative array transducer obeys linear time bounds 
if the cellular automaton transducer does. □ 

The previous result can be generalized to arbitrary time complexities beyond linear time as long as 
the iterative arrays use linear space only. For space complexities beyond linear space, clearly, iterative 
arrays are stronger than cellular automata, since the latter are linearly space bounded by definition. 

4 Comparison with Finite State Transducers and Pushdown Transducers 

Here, we turn to compare cellular automaton transducers with finite state transducers (FST) and push- 
down transducers (PDT). These devices are in essence finite automata and pushdown automata, where 
each transition is associated with a possibly empty output word (see CD). In their most general form, 
FST and PDT are nondeterministic devices, that is, the partial transition function of an FST maps from 

5 x (A U {A}) into the finite subsets of S x B*. As above, 5 denotes the state set and A the input alpha- 
bet. The partial transition function of a PDT maps from S x (AU {A}) x G into the finite subsets of 
5x6* x G* , where G denotes the pushdown alphabet. Since a nondeterministic transducer may trans- 
form an input into different outputs, which is impossible for deterministic CAT, in the sequel we only 
study deterministic, unambiguous, and single valued devices. 

An FST Mis called single valued (SFST) if for all (wi,vi), (w2,V2) 6 T(M) either (wi,vi) = (w2,V2) 
or wi / W2. An SFST is said to be unambiguous (UFST) if for all (w,v) G T(M) there is a unique 
computation transforming w into v. Finally, a UFST is deterministic (DFST) if any computation is 
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deterministic. It has been shown in [9] that every single-valued finite state transducer can be simulated 
by an unambiguous one. Furthermore, it is known (see, for example, |9]) that 

^(DFST) C ^(UFST) = ^(SFST). 

The notions of single-valued PDT (SPDT), unambiguous PDT (UPDT), and deterministic PDT 
(DPDT) are defined analogously. Additionally, a UPDT is called real-time deterministic (DPDT^) if 
it is not allowed to move on empty input. The following proper hierarchy is known: (see, for exam- 
ple, @) 

^(DPDT A ) C ^(DPDT) C ^(UPDT) C ^(SPDT). 

In Q it has been shown that any DFST can be simulated by some IAT rt rt , and any SFST can be 
simulated by some IAT rt / f . Here, we prove that both devices can be simulated by some CAT as well. 
Interestingly, the device with parallel input/output mode can compute the transductions fast, in particular 
in real time, which is in contrast to the devices with sequential input/output mode. 

Lemma 10 The families ,3^ (DFST) and ^(SFST) are strictly included in ^(CAT rt , rt ). 

Proof We consider the transduction { (w,w R ) \ u G {a,b}* } of Example[5]which belongs to £^(CAT rt rt ), 
but clearly cannot be computed by any finite state transducer. Next, we describe how a CAT r , rf can 
simulate an SFST. Trivially, this construction applies to DFST as well. The idea of the simulation is 
similar to the construction for IAT, T /, given in J6[. However, here we can reduce the time complexity 
and, thus, have to cope with the problem of speeding up the computation to real time. 

Let M = (S,F,A,B,sq,S) be an unambiguous SFST. Due to a result in [9] we may assume without 
loss of generality that M does not move on empty input. First, from M a nondeterministic finite automa- 
ton Mnfa = {S,F,A,sq,8') is extracted that accepts L(M). Then, automaton Mnfa is converted into an 
equivalent deterministic finite automaton Mdfa by the powerset construction. 

Now we turn to the construction of a CAT rt rt M' which simulates M. Transducer M' has several 
tracks. The input is stored in the first register and, additionally, its second register is used to shift the 
input to the left in every time step. In the third register of the leftmost cell the deterministic finite 
automaton Mjjfa is simulated which receives its input on the second track. Now M' accepts if and only 
if Mqfa accepts the input w at time \w\. 

The second task of M' is to compute the output. For this purpose, the unique accepting computation 
of Mnfa has to be identified among all computations on w. As a first step, automaton Mnfa is converted 
into an equivalent right linear grammar Gnfa with axiom X. The productions of Gnfa have three different 
forms: 

1. X — > a[q'] for all transitions q' G 8'(so,a) with a G A, 

2. [q] — > a[q'] for all transitions q' G 8'(q,a) with q G S,a G A, 

3. [q] — > a for all transitions q' G 8'(q,a) with q G S, q' G F, and a G A. 

So, every production in Gnfa corresponds to a transition rule in M^fa and M and, thus, corresponds to 
an output u G B*. 

Let w = a\a,2 • ■ -a n . We consider sets Vi,Vz,... ,V n of nonterminals from Gnfa so that Y G V,-, if and 
only if there is a derivation Y =^>* aiai+x ■■•a n in Gnfa- Set V„ includes exactly all nonterminals Y for 
which the production Y — > a n belongs to Gnfa- In general, for 1 < i < n, the set V,- includes exactly all 
nonterminals Y for which the production Y — > a{Z belongs to Gnfa and Z G Vi+\. Clearly, set V, can be 
computed from a,- and V/+i. 
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Let us assume for a moment that n is even. The next construction step is to set up M' so that 
V\,V2,...,V n are computed in the cells S + l, 5+2, . . . ,n within | time steps. To this end, on an additional 
track the input is shifted to the right in every time step. Moreover, in the first step the rightmost cell 
computes the sets V n -\ and V n with the knowledge of and a n . In the next time step, cell n — 1 
computes the sets V„_3 and V„_ 2 with the knowledge of a n _3, a n -2, V n -\, and V n . In general, cell n—i+1 
computes the sets V„_ 2 (!-i)-i and V„_ 2 (;-i) m ti me ste P J with the knowledge of a n _2(i-i)-u £Z «-2(i-i)» 
V„_ 2 ((-i)+i> and V ;i _2(i-i)+2- Additionally, the symbols a„-2((-i)-i and a„_ 2 (i-i) are stored in another 
two registers. Thus, at time step 5 the sets V\ and V 2 are computed in cell | + 1. (see Figure [7] for an 
example). The case when n is odd is handled similarly. Then, the sets Vi, V2, ■ ■ ■ , V n are computed in the 
cells [~|] , [|] + 1, . . . ,n within [~|] time steps. 

From the sets V/ now the unique accepting computation path of M is extracted. Clearly, w S L(Gnfa) 
if and only if X G V\. Moreover, there is only one production of the form X — > a\Z\ in V\. Otherwise 
the accepting path would not be unique. For the same reason there is only one production of the form 
Zi — > a 2 Z 2 in V 2 , and so on. Let us again assume for a moment that n is even, and let the unique sequence 
of productions that derive w be pi,P2, ■ ■ ■ ,Pn- At time step 5 + 1, the cells j and § + 1 determine the 
productions p\ and /? 2 . This is possible, since both cells can identify themselves in time step S + 1, and 
all necessary information is available in the cells and their neighborhoods. Additionally, the productions 
computed are sent to the left on an additional track. Furthermore, in cell | + 1 a signal R is sent to the 
right. In the next time step, cell 5 + 1 determines p^ and by signal R cell ^ + 2 is caused to determine 
P4. Both productions are subsequently shifted to the left. In general, R arrives at cell | + i at time step 
2 + i and causes cell | + i — 1 to determine p2i-\ and cell | + i to determine p2i- Again, all necessary 
information is available in the cells and their neighborhoods. The case when n is odd can be handled 
similarly again. In this case, the productions p\,p2,...,p n are computed in the same way in the cells 
[5] — 1 , [S] , . . . ,n and similarly are sent to the left. 

By construction, production p\ reaches cell i at time step n, for 1 < i < n. At this moment, the output 
Ui G B* associated with the transition rule that led to the definition of the production has to be emitted 
in cell i. So, it remains to be ensured that all cells are synchronized at time step n. As is described in 
Example [3] this can be achieved by simulating an FSSP on another track. 

Altogether, we obtain that M' simulates M, accepts and emits the output in real time. Thus, SFST M 
is simulated by a CAT rf , rt . □ 

The next result follows from known results on IAT rt rt and the simulation of IAT by CAT as presented 
in Section [3] It is worth mentioning that ^(SFST) and ^(DPDT^) are incomparable [6]. Here, we 
obtain that both classes are included in j7(CAT, t 

Lemma 11 The family ^{DPDTx ) is strictly included in £?(CAT rtJ - t ). 

Proof The assertion follows from the fact that ^(DPDT A ) is strictly included in ^(IAT rt!rt ) (|3) and 
that ^(IAT rf , rt ) is strictly included in ^(CAT rt rf ) due to Corollary|7] □ 

Finally, we will show that any DPDT can be simulated by some CAT,-, j t . This is again an improve- 
ment in comparison with IAT. It has been shown in [6] that any DPDT can be simulated by some IAT/, /, 
which in turn can be simulated by some CAT/, /, owing to Theorem[9] Here, we obtain that the simulation 
can already be achieved by some CAT r , /,. 

Lemma 12 The family ,^{DPDT) is strictly included in ^(CAT rtJt ). 
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Figure 7: Schematic computation of a CAT rf rt simulating an SFST on input ai«2 • • - as- m the first four 
time steps, the sets Vi,V2,..-,V% are computed in cells 5, 6, 7, and 8. Thus, an accepting path has been 
stored in the last four cells which is extracted in the last four time steps and the corresponding output of 
the transitions is distributed to the correct cells. The simulation of Mjjfa and the synchronization is not 
depicted. 

Proof The transduction T = { (ww,wc' w \) \ w G {a,b} + } cannot be computed by any pushdown trans- 
ducer, since the language { ww | w G {a,b} + } is not context free. 

On the other hand, transduction T can be computed by a CAT rt rt and, thus, by a CAT rt /,. To this 
end, as in Example[3]two instances of the FSSP are initiated at both ends of the array that cause each cell 
to fire at time n. Firing of a cell in the left half means to emit the original input symbol and firing in the 
second half means to emit symbol c. Since the language {ww \ w G {a,b} + } is accepted by a real-time 
CA this shows T G ^(CAT rt , rt ). 

Given a deterministic pushdown transducer M with state set S and pushdown store alphabet G, we 
next construct a CAT,., j t M' simulating M. 

There is a constant k\ > such that M cannot push more than k\ pushdown symbols in one time step, 
and M can pop at most one pushdown symbol in one time step. Moreover, there is a constant ki < \S\ ■ \G\ 
such that M cannot perform more than &2 subsequent moves on empty input. From these facts follows 
that M works in linear time. Let k = max{&i , £2}- 

Basically, the CAT rt \ t M' computes five tasks on different tracks. On the first track, a deterministic 
pushdown automaton is simulated in real time that accepts the language L(M). The details of such a 
simulation can be found in fl4[. 

It remains to be shown how M' computes the output of M. The second track is used as follows. The 
leftmost cell simulates the state transitions of M while on request the other cells shift the input to the 
left, thus, providing the input for the leftmost cell. In detail, when the simulation of M consumes an 
input symbol, a signal is sent to the right which causes the cells to shift the input one position to the left. 
Otherwise, when M simulates a transition on empty input no signal is sent. 

The pushdown store of M is simulated on the third track. In [|4l it has been shown how to simulate 
the data structure pushdown store without loss of time. Since here at most k symbols are pushed in one 
time step, the simulation can be realized by grouping k pushdown symbols together. 

On the fourth track, a data structure queue is implemented as is also shown in 0. The leftmost cell 
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stores transitions simulated on the second track into this queue. Since at most k consecutive transitions 
are on empty input, grouping at most k + 1 transitions into one symbol to be stored ensures that any 
symbol in the queue represents at least one transition consuming an input symbol. So, at most as many 
symbols are stored as the input is long. Since M works in linear time, it is not difficult to see that all 
these tasks are simulated by M' in linear time as well. 

The final task is to emit the output. After acceptance of the input, on the fifth track a signal is 
started from the leftmost cell to the right which provides sufficient time so that all symbols are properly 
stored in the queue. Having reached the rightmost cell, the signal changes its direction and moves 
back to the leftmost cell. On its way it causes each cell passed through to emit u\U2 •••u m G B*, if it 
stores (pi,p2,.--,Pm) in its fourth register (the queue register), where Uj is the output associated with 
transition pu I <i < m. If the fourth register is empty, the cell emits X. When the signal arrives at the 
leftmost cell again, the transduction is completed in linear time. □ 
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